Back

BMC Medical Genomics

Springer Science and Business Media LLC

Preprints posted in the last 7 days, ranked by how well they match BMC Medical Genomics's content profile, based on 36 papers previously published here. The average preprint has a 0.05% match score for this journal, so anything above that is already an above-average fit.

1
Liver biopsy confirms precise and efficient correction of SERPINA1 after in vivo Base Editing in a Patient with Alpha-1 Antitrypsin Deficiency

Krooss, S. A.; Yang, T.; Yuan, Q.; Drick, N.; Sgodda, M.; Held, J.; Behrendt, P.; Hartleben, B.; Koczulla, R.; Ma, X.; Liu, Y.; Wedemeyer, H.; Janciauskiene, S.; Di Donato, N.; Cantz, T.; Wang, E.; Wu, Y.; Hoeper, M.; Xia, Q.; Ott, M.

2026-06-09 genetic and genomic medicine 10.64898/2026.06.01.26354551 medRxiv
Top 1%
0.8%
Show abstract

Background: Alpha-1 antitrypsin deficiency (AATD) caused by the PI*ZZ mutation (Glu342Lys) results in hepatic accumulation of misfolded AAT-Z protein and reduced circulating AAT levels, leading to progressive liver disease and emphysema. Gene correction therapy represents a potentially curative approach by directly correcting the underlying genetic defect. We report the first case of successful hepatic gene correction with early histological and functional assessment. Methods/Case presentation: We report the case of a 66-year-old male patient with PI*ZZ AATD who underwent gene correction therapy within the YOLT-202 phase I/Ia clinical trial (clinical trial.gov ID NCT07193615). Ten weeks post treatment a liver biopsy was performed to re-evaluate pre-existing F2 liver fibrosis as measured by elastography before entering the study. Serum samples allowed functional assessment of the AAT-mediated elastase inhibition. Results: Liver biopsy did not show signs of hepatic inflammation and demonstrated 54% (Sanger) and 57% (Illumina) gene correction rate of the PI*ZZ variant on the DNA level with no bystander edits or off-target effects. Following a transient elevation of transaminases during the early post-treatment period, liver enzymes normalized. Monthly serum AAT measurements demonstrated biologically active and stable therapeutic levels throughout follow-up. Conclusions: This case demonstrates efficient and precise hepatic gene correction without concerning histological alterations and with substantial improvement of functional parameters, supporting the feasibility and safety of gene editing approaches for AATD.

2
Dissecting the functional landscape of rare diseases through genomic variation in a heterogeneous cohort of 11,000 patients

Uria-Regojo, G.; Fernandez-Caballero, L.; Lopez-Alcojor, A.; Lopez-Lopez, L.; Benitez, Y.; Rodilla, C.; Avila Fernandez, A.; Trujillo-Tiebas, M. J.; Osorio, A.; Corton, M.; Almoguera, B.; Ayuso, C.; Minguez, P.

2026-06-11 genetic and genomic medicine 10.64898/2026.06.10.26355349 medRxiv
Top 1%
0.7%
Show abstract

Rare diseases (RDs) remain a major diagnostic challenge. Genetic and phenotypic heterogeneity, incomplete knowledge of disease mechanisms, and limitations in variant clinical interpretation leave many patients without a molecular diagnosis. Meanwhile, the growing volume of genomic data generated in clinical practice offers an opportunity to develop data-driven methodologies for exploring disease mechanisms and improving the reanalysis of unsolved cases. We aggregated real-world genomic data from 11,084 unrelated patients with suspected RD. Patients were clinically classified into 122 diseases. We built a multi-disease genomic variant frequency database (FJD-DB), which enabled the development of variant and gene-disease association scores by means of case-control subcohort comparisons across 32 disease groups. Functional enrichment analyses were then used to highlight disease-associated protein domains, pathways, biological processes, and phenotypes. Finally, the resulting knowledge was integrated into a data-driven framework for the guided reanalysis of unsolved RD patients applied to Inherited Retinal Dystrophies (IRD) patients as first use case. FJD-DB contained more than 45 million unique variants, including ~185,000 potentially pathogenic variants. Disease-specific analyses identified disease-associated pathogenic variants and highlighted both established and candidate disease genes. We detected 179 significantly enriched protein domains across 23 diseases, 124 Human Phenotype Ontology terms across 13 diseases, 79 Reactome pathways across 10 diseases, and 72 Gene Ontology biological processes across 8 diseases, revealing highly disease-specific functional signatures. Integration of disease-specific variant, gene, and functional association signals enabled the development of a data-driven framework for guided reanalysis of unsolved RD cases. Applied to more than 1,100 unsolved IRD cases, the framework generated clinically relevant findings in 26 patients, including four molecular diagnoses, seven candidate diagnoses, and 15 cases upgraded from non-informative findings to variants of uncertain significance. Aggregated real-world genomic data can be leveraged to identify disease-associated molecular signals generating novel biological hypotheses. A unified analytical framework provides a scalable strategy for knowledge discovery and guided reanalysis, facilitating the identification of overlooked and potentially novel genetic causes of RDs.

3
When Algorithms Prescribe: A Cross-Sectional Study of Quality, Misinformation, and Engagement in Statin-Related Content on TikTok

Gharibyan, I.; Ahner, E.; Shao, R.; Sharma, D.; Navarsartian Tazehkand, T.; Diep, J.; Assoumou, B.

2026-06-08 health informatics 10.64898/2026.06.04.26354962 medRxiv
Top 1%
0.7%
Show abstract

Background: Statins are key to preventing atherosclerotic cardiovascular disease and lowering low-density lipoprotein cholesterol and cardiovascular events. However, skepticism regarding their safety and value persists and is increasingly influenced by social media. TikTok has emerged as a major source of health information, but its content varies in quality and accuracy. This study evaluated the quality, attitudes, misinformation, and engagement of statin-related content on TikTok. Methods: Public TikTok videos were collected using predefined search terms and coded by creator type, thematic content, and overall attitude. Video quality was assessed using the DISCERN instrument, the Patient Education Materials Assessment Tool for Audiovisual Materials, and the Global Quality Score. False or misleading claims were independently reviewed by two cardiology fellows. Associations between engagement and quality were also examined. Results: Of 1,349 screened videos, 258 met inclusion criteria. Most were educational (91.0%), with non-physician healthcare providers (34.5%) as the largest creator group. Risks or negative effects were discussed more often than benefits (63.2% vs 42.2%), and 39.5% contained at least one false or misleading claim, most often from complementary and alternative medicine providers and wellness promoters. Quality differed by creator type across all instruments, with physician-created content scoring highest. Video popularity showed minimal association with informational quality. Conclusion: Statin-related TikTok content frequently emphasizes harms, often contains misinformation, and varies substantially in quality by creator type. Greater involvement of healthcare professionals on social media may help improve digital health literacy and counter misleading information about statin therapy.

4
Investigating the Y chromosome in complex disease: Phenome-wide scan across 104,334 Finnish men

Preussner, A.; Leinonen, J. T.; FinnGen, ; Pirinen, M.; Tukiainen, T.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.09.26355235 medRxiv
Top 2%
0.7%
Show abstract

Although the Y chromosome represents roughly 2% of the male genome, it is often ignored in genome-wide association studies (GWAS). Subsequently, the potential health impacts of Y-chromosomal genetic variation remain incompletely understood. To fill this gap, we performed a phenome-wide association study (PheWAS) in FinnGen across 1,426 binary and quantitative traits using Y-chromosomal variation (frequency [&ge;] 1%) in 104,334 genotyped men. As Y chromosome variation is prone to population stratification, we performed carefully adjusted association analyses and further examined these through kin-based validation in 19,275 female and 24,712 male 1st degree relatives. We found 121 suggestive (p < 5.6x10-3) phenotypic associations in the Y chromosome, yet none of these were strong enough to reach phenome-wide significance (p < 3.9x10-6). While only 38 associations were supported in the kin-based validation, intriguingly we found support for a previously suggested link between haplogroup I1 and coronary heart disease (CHD; OR=1.06, 95%CI=1.02-1.11, p=3.7x10-3; male validation OR=1.05; female validation OR=0.97). The I1-CHD association was detected across distinct geographical areas within Finland and was independent from Loss of Y (LOY) and the autosomal risk to CHD, proposing a link between germline Y-chromosomal variation and heart disease risk. Overall, this study presents a comprehensive phenome-wide analysis of Y-chromosomal associations, highlighting the potential relevance of Y-chromosomal variation beyond sex determination. Our findings further emphasize the need for improved capture of Y-chromosomal variants and further analyses in biobank-scale data to allow for deeper exploration of male-specific genetic architecture of complex diseases.

5
Reproductive health in Mexican women with systemic lupus erythematosus: pregnancy outcomes, menstrual irregularities and early menopause

Sevilla-Parra, G.; Bravo-Garcia, F.; Mier y Teran Guevara, M.; Montes-Garcia, A.; Schäfer, A.; Ochoa-Rodriguez, N.; Bienvenu Caballero, M.; Gonzalez Zenteno, S. G.; Pena-Ayala, A.; Tinajero-Nieto, L.; Torres-Valdez, E.; Martinez, D.; Hernandez-Ledesma, A. L.; Medina-Rivera, A.; Alpizar-Rodriguez, D.

2026-06-09 sexual and reproductive health 10.64898/2026.06.07.26354004 medRxiv
Top 2%
0.7%
Show abstract

Objective: To characterize pregnancy outcomes and menstrual irregularities in Mexican women with systemic lupus erythematosus (SLE) and identify clinical factors associated with adverse pregnancy outcomes and early-onset menopause. Methods: We conducted a cross-sectional study of women with SLE enrolled in the Mexican Lupus Registry (LupusRGMX) between May 2021 and September 2024. Clinical and reproductive data were collected using standardized questionnaires. Menopause was defined as the absence of menstruation for [&ge;]12 consecutive months, and early menopause as onset before age 40. Univariable and multivariable logistic regression analyses were used to identify factors associated with pregnancy complications and early menopause. Results: A total of 210 women were included. Median age was 38 years (IQR 29-46) and median disease duration was 4 years (IQR 1-10). Among women with a history of pregnancy (47%), full-term delivery predominated (61%), while pregnancy loss occurred in 26% and preterm delivery in 13%. Pregnancy complications were reported in 9.6%, most commonly preeclampsia (6.7%). Younger maternal age was independently associated with pregnancy complications (OR 0.89, 95% CI 0.83-0.95) and adverse outcomes (OR 0.95, 95% CI 0.92-0.98). Higher disease activity was associated with complications in univariable analysis. Most pregnancies (68.3%) occurred before diagnosis. Early menopause was observed in 6.2% and independently associated with longer disease duration and older age. Conclusion: Younger maternal age was independently associated with adverse pregnancy outcomes, whereas disease activity showed an association in univariable analysis. Most pregnancies occurred prior to SLE diagnosis. Early menopause was associated with longer disease duration, suggesting impact of cumulative disease burden on ovarian function.

6
Population-scale detection of methylation outliers from long-read genome sequencing

Jensen, T. D.; Kaur, R.; Bonner, D. E.; Nguyen, J.; Reuter, C. M.; Undiagnosed Diseases Network, ; Genomics Research to Elucidate the Genetics of Rare Diseases (GREGoR) Consortium, ; Ashley, E. A.; Bernstein, J. A.; Wheeler, M. T.; Montgomery, S. B.

2026-06-11 genetic and genomic medicine 10.64898/2026.06.09.26355279 medRxiv
Top 2%
0.7%
Show abstract

Background: Aberrant DNA methylation can mediate the functional effects of rare genetic variation and contribute to imprinting disorders, repeat expansion diseases, and other pathogenic regulatory mechanisms. Long-read sequencing technologies now enable genome-wide detection of CpG methylation alongside genetic variation from a single assay. However, methods for systematic identification and interpretation of methylation outliers from long-read sequencing data remain limited. Methods: We developed METAFORA, a computational workflow for detecting methylation outlier regions from PacBio and Oxford Nanopore long-read sequencing data. METAFORA constructs population-level methylation references, segments the genome into correlated CpG blocks, infers technical and biological sources of variation through hidden factor estimation, models uncertainty due to variable depth sequencing, and computes covariate-adjusted methylation outlier scores for individual samples. We applied METAFORA across large long-read sequencing cohorts and integrated methylation outliers with multi-omic data. METAFORA is implemented as a snakemake workflow available at https://github.com/tjense25/METAFORA. Results: METAFORA identified methylation outlier regions associated with rare structural variants, tandem repeat expansions, and imprinting abnormalities. We found outlier regions were enriched for molecular outliers across transcriptomic and chromatin accessibility datasets, supporting their functional relevance in gene regulation. In a representative case, METAFORA identified an imprinting defect affecting the GNAS locus associated with an STX16 deletion. Conclusions: METAFORA enables scalable detection and interpretation of methylation outliers from long-read sequencing data and provides a framework for integrating epigenetic outliers with genomic and multi-omic analyses. These approaches may improve interpretation of rare regulatory variation and support discovery of clinically relevant epigenetic abnormalities in genomic medicine.

7
Global population frequencies of NAT2 star alleles observed in three large biobanks

Sangkuhl, K.; Whirl-Carrillo, M.; Woon, M.; Venkatesh, R.; Keat, K.; Whaley, R.; Ritchie, M. D.; Klein, T. E.

2026-06-11 genetic and genomic medicine 10.64898/2026.06.09.26355281 medRxiv
Top 2%
0.5%
Show abstract

NAT2 is an important pharmacogene which encodes the N-acetyltransferase 2 enzyme that is involved in the metabolism of multiple medications, and variants in this gene can affect patient response to these medications. CPIC has published a clinical guideline for prescribing hydralazine using NAT2 genotypes. Just prior to the guideline, updated NAT2 star allele numbering and definitions were released, differing somewhat from the historical nomenclature. Clinical pharmacogenomic testing panels often test for the most common star alleles, so knowledge of the most common updated NAT2 star alleles is critical for the implementation of the CPIC NAT2/hydralazine guideline. We first determine NAT2 diplotype frequencies from UK Biobank (UKBB) 200k phased genomes, then analyzed allele, diplotype, and phenotype population frequencies from the All of Us Research program, PennMedicine BioBank (PMBB) and UKBB 500k datasets. We found that analyzing NAT2 diplotypes from phased data provides critical information for algorithms designed to predict diplotypes from unphased data. We observed that NAT2*5, *6, and *4 were the most common star alleles in that order, and the top 11 most frequent NAT2 star alleles were the same across all biobanks. However, differences in star allele frequencies across biogeographical populations were observed. The largest difference led to a higher frequency of NAT2 poor metabolizer phenotypes as compared to rapid and intermediate metabolizer phenotypes in all global populations except in the EAS population, where NAT2 poor metabolizers were in the minority.

8
Polygenic risk scores associate with asthma phenotypes and proteomic analyses implicate IL1R1 in two family-based studies

Lee, S.; Moll, M.; Mendez, K.; Prince, N.; Lasky-Su, J.; Lutz, S. M.; Weiss, S. T.; Lange, C.; Kelly, R. S.; Hecker, J.

2026-06-11 genetic and genomic medicine 10.64898/2026.06.06.26355045 medRxiv
Top 2%
0.4%
Show abstract

Despite its high prevalence and the discovery of hundreds of genetic associations, the genetic determinants and heterogeneous manifestations of asthma remain incompletely understood. Incorporating polygenic risk scores (PRS) into asthma research offers a powerful approach to quantify inherited susceptibility, refine risk profiles, and advance mechanistic understanding of disease development. For this study, we leveraged whole-genome sequencing (WGS) data from two family-based cohorts of childhood asthma - the Genetics of Asthma in Costa Rica Study (GACRS) and the Childhood Asthma Management Program (CAMP) - to examine the transmission profiles of externally derived asthma PRS and their associations with clinical phenotypes in children with asthma. To further elucidate molecular mechanisms, we integrated large-scale external genome-wide association study (GWAS) summary statistics and genetic prediction models of protein abundance in a two-step proteome-wide association study (PWAS) of asthma. Our findings provide robust evidence supporting the validity of externally derived asthma PRS (asthma PRS association p-value p={10}^{-24} [GACRS and CAMP trios combined] for the Global Biobank Meta-analysis Initiative [GBMI]) and reveal consistent associations with spirometry measures and atopy markers across both studies, as 13 of 21 traits (62%) were significantly associated with the GBMI-PRS in the meta-analysis after multiple-testing correction. Moreover, the results of the integrative proteomic analysis implicate IL-1 signaling in the etiology of asthma, reinforcing the candidacy of IL1R1 antagonists for drug repurposing.

9
Healthcare professionals' perspectives on a multilevel cardiovascular risk management intervention (PROSPERA programme)

Bongaerts, V. A. M. C.; van Gestel, L. C.; van Peet, P. G.; Vuijk, M.-L. S.; Hageman, S. H. J.; Dorresteijn, J. A. N.; Bonten, T. N.; Numans, M. E.; van Os, H. J. A.; Vos, R. C.

2026-06-09 cardiovascular medicine 10.64898/2026.06.08.26355169 medRxiv
Top 2%
0.4%
Show abstract

Background: Two-thirds of Dutch cardiovascular risk management (CVRM) for patients at risk of cardiovascular disease is delivered in primary care practices. While individual risk scores are increasingly used during consultation, a population-level structure for risk-based patient outreach is not currently available. We therefore developed the PROSPERA programme, a multilevel intervention comprising population-level risk stratification and individual-level support tools. Aim: To assess anticipated and experienced barriers and facilitators among healthcare professionals (HCPs) to inform implementation in primary care. Methods: We conducted four focus groups and six interviews with nine primary care HCPs to explore anticipated and experienced barriers and facilitators. Inductive codes were thematically analysed and assigned to corresponding domains of the Theoretical Domains Framework (TDF) and the related Capability, Opportunity, Motivation model of Behaviour. Results: Barriers and facilitators were identified in 11 TDF domains. Population-level barriers included altered professional roles and limitations in technological infrastructure. Individual-level barriers were limited skills in interpreting risk calculations and difficulty integrating tools into clinical routine. Facilitators were related to beliefs on the importance of providing proactive care (population level), the use of U-Prevent for risk communication (individual level) and positive patient responses to the Lifestylecheck questionnaire (individual level). Conclusion: Addressing barriers and facilitators identified at both the population and individual levels can support implementation of the PROSPERA programme. Opportunities exist in education and training of HCPs in risk communication, as well as support in restructuring the physical and digital environment.

10
Next-Generation Skin Cancer Detection Using Efficient Fuzzy Fusion of Genomic and Imaging Data

Molla, A. R.; Maity, A.; Saha, S.; Bhattacharya, R.; Chakraborty, A.; Biswas, S.; Nath, S.

2026-06-08 health informatics 10.64898/2026.06.05.26355024 medRxiv
Top 3%
0.4%
Show abstract

Skin cancer requires early detection for improved survival rates. Most existing methods rely on deep learning based image classification, which is affected by visual similarity among lesions. Fewer studies use Gene Expression (GE) analysis, which captures molecular characteristics but lacks structural and visual details. To overcome limitations of individual modalities, this paper proposes a multimodal framework integrating dermoscopic images and GE profiles for skin cancer classification. EfficientNet and logistic regression are used for image based analysis and genomic skin lesion profiling, respectively, followed by fuzzy rule based decision systems to reduce uncertainty within individual modalities. Finally, fuzzy fusion combines predictions from both modalities using uncertainty based weighting of classifier outputs. The experimental findings show that both the image based and GE based classification models individually achieved accuracies of nearly 92%. However, the integration of prediction results through the proposed fuzzy fusion strategy further enhanced the classification performance, achieving an overall accuracy of 94.25%. The results obtained outperform contemporary methods, highlighting the effectiveness of combining complementary multimodal information compared with single modality approaches.

11
Plasma protein prioritisation in rheumatoid arthritis reveals druggable targets and shared biology with cardiovascular diseases

Alduhayhi, S. S.; Morris, A. P.; Zhao, S.; Bowes, J.

2026-06-11 epidemiology 10.64898/2026.06.10.26355332 medRxiv
Top 3%
0.3%
Show abstract

Abstract Background Rheumatoid arthritis (RA) is an autoimmune inflammatory disease with complex and incompletely understood molecular mechanisms. Understanding circulating proteins associated with RA may improve understanding of disease biology and clarify its pathological links with cardiometabolic comorbidities. Methods A proteome-wide two-sample Mendelian randomisation (MR) drug target analysis was conducted using plasma proteins measured in 54,219 participants from the UK Biobank Pharma Proteomics Project as exposures and RA and cardiometabolic diseases as the outcomes. Summary statistics for RA included 53,663 cases and 1,070,200 controls. Colocalisation analysis was performed to confirm shared single causal variants and prioritise RA proteins supported by both MR and colocalisation. The prioritised proteins were then evaluated in the Accelerating Medicines Partnership RA Phase II synovial single-cell dataset for cell-type expression patterns. Druggability was then assessed followed by analysis of genetic overlap between RA-associated proteins and cardiometabolic diseases. Results 37 plasma proteins had a causal effect on RA risk, supported by combined evidence from MR and conditional colocalisation. In synovial tissue, TPPP3, RARRES2, AKAP12, and GGT5 were predominantly expressed in stromal and endothelial cell clusters. Druggability assessment identified IFNGR2, IL6R, CD40, and FCGR2B as Tier 1 targets. However, several biologically relevant proteins, including RARRES2, AKAP12, TPPP3, and SNX2, had limited available druggability data. Genetic overlap analysis demonstrated shared protein signals between RA and cardiovascular diseases, including overlap of RARRES2 and TPPP3 with coronary artery disease (CAD) and FCGR2B with atrial fibrillation (AF). To approximate the therapeutic effect of target inhibition, the direction of effect estimates for proteins showing overlap between RA-CAD and RA-AF was reversed. Conclusion This study identified circulating proteins involved in RA pathogenesis and reveals shared mechanisms between RA and cardiovascular diseases. While some proteins showed clear translational potential targets, several prioritised proteins had limited available druggability information and could not be confidently classified. Addressing these gaps may help identify new targets relevant to RA management. Future work should also use phenome-wide MR studies to evaluate potential on-target adverse effects of protein inhibition across RA-CAD and RA-AF.

12
Incremental Clinical Value of Single-Molecule Nanopore Sequencing in Thalassemia Testing: A Prospective Double-blind, Multicenter Study

Xiang, J.; Zhu, B.; Xu, H.; Chen, Y.; Sun, X.; xiang, r.; Zhao, Y.; Liu, W.; Zhang, L.; He, J.; liu, j.; Chen, Y.; Fan, Z.; Zhang, H.; Tan, J.; Pang, L.; Shi, L.; Kong, Y.; Cai, A.

2026-06-09 hematology 10.64898/2026.06.09.26354559 medRxiv
Top 3%
0.3%
Show abstract

Background Thalassemia is one of the most common monogenic disorders worldwide, current screening strategies combining hematological testing with molecular assays still carry a risk of missed diagnoses and undesirable efficiency, particularly for complex structural variants and rare mutations. Methods In this prospective double-blind, multicenter cohort study of 3,842 participants (3,362 pregnant women and 480 male partners), we conducted a head-to-head comparison to systematically evaluate the incremental clinical value and detection performance of single-molecule nanopore sequencing in thalassemia (SMITH) against conventional hematological testing and next-generation sequencing (NGS). Findings The overall concordance rate between NGS and SMITH was 98.6% (3789/3842). The discrepant cases (n=53) were directly attributed to the superior detection capabilities of SMITH, which successfully identified complex structural rearrangements-including 45 -globin gene triplications and four HK alleles-that were missed by NGS. Furthermore, SMITH accurately detected four rare variants (c.134_135insT/, c.-22(C>T)/, {beta}N/{beta}c.316-290delinsAGGGCAATAATTT and {beta}3.5 kb deletion/{beta}N ) and resolved ten trans and three cis configurations within the globin gene allele. Clinically, these technical advantages translated to a 9.3% (5/54) increase in the detection rate of high-risk prenatal couples, effectively preventing one birth affected by moderate-to-severe thalassemia. Additionally, SMITH corrected a diagnostic discrepancy in one case (HK vs. -3.7), sparing the couple from an unnecessary invasive procedure. Interpretation Our findings demonstrate that SMITH provides a powerful platform for resolving globin gene rearrangements, detecting rare variants, and enabling direct haplotype phasing. By effectively eliminating diagnostic blind spots, SMITH is expected to become an optimal method for thalassemia prevention programs. Funding This study was supported by Chinese National Natural Science Foundation Projects 81760037 and 82271894.

13
Perceived Social Support and Self-Efficacy as Mediators Between Health Literacy and Quality of Life Among Middle-Aged and Older Adults with Hypertension: A Cross-Sectional Study in Six Central Provinces of China

Zhao, Y.; Yun, Y.; Bai, T.; Xiong, L.; Ruan, Y.; Zhao, H.; Wang, W.; Wang, F.

2026-06-08 public and global health 10.64898/2026.06.06.26355051 medRxiv
Top 5%
0.2%
Show abstract

Abstract Objective: The onset of hypertension occurs at a younger age in China, and the relationship between health literacy and quality of life among middle-aged and older hypertensive patients remains unclear. This study explored whether perceived social support and self-efficacy mediate the association between health literacy and quality of life in middle-aged and older hypertensive patients. Methods: A questionnaire was administered to 1,015 middle-aged and older hypertensive adults from communities in six central provinces of China. The EQ-5D scale, Perceived Social Support (PSS) scale, Self-Efficacy Scale (SES), and Health Literacy Scale (HLS) were used to assess quality of life, social support, self-efficacy, and health literacy, respectively. Mplus 8.3 software was used to construct a structural equation model for path analysis. Results: The mean PSS, SES, HLS, EQ-5D, and EQ-VAS scores were 15.57{+/-}3.45, 10.61{+/-}2.41, 9.49{+/-}2.86, 0.88{+/-}0.18, and 71.06{+/-}17.49, respectively. Health literacy and quality of life scores significantly differed among middle-aged and older hypertensive patients, and both showed positive correlations with perceived social support and self-efficacy (both P<0.001). Perceived social support and self-efficacy exhibited a chain mediated effect on the relationship between health literacy and quality of life (EQ-5D utility index and EQ-VAS), accounting for 28.57% of the total effect of the EQ-5D utility index and 27.26% of that of the EQ-VAS. This study is the first to elucidate the mechanism by which health literacy influences quality of life in middle-aged and older hypertensive patients through the chain-mediated effect of perceived social support and self-efficacy. Conclusion : Health literacy is significantly correlated with quality of life in middle-aged and older hypertensive patients. This correlation can directly or indirectly explain the impact on quality of life through mediating pathways involving perceived social support and self-efficacy. Keywords: hypertensive patients, perceived social support, self-efficacy, health literacy, quality of life, mediating effect

14
Epidemiology of Cervical Precancerous Lesions: Prevalence and Predictors from Pap Smear Screening in Hawassa City Hospitals, Sidama Region, Ethiopia. Institutional-Based Cross-sectional Study

Fisshatsion, A. B.; Zewude, Y. A.; Nisro, A. M.; Abebe, R. F.

2026-06-10 public and global health 10.64898/2026.06.09.26355254 medRxiv
Top 5%
0.2%
Show abstract

Background: Cervical cancer is the fourth most common cancer in women worldwide and remains a major public health challenge. In Ethiopia, it is the second leading cause of cancer deaths, with around 8,000 new cases and 6,000 deaths each year. Region?specific data on the prevalence and predictors of precancerous lesions remain scarce, yet such information is vital for guiding targeted reproductive health strategies. This study therefore examined the prevalence and predictors of cervical precancerous lesions among women aged 21-60 years undergoing Pap smear screening in public hospitals in Hawassa City, Sidama Region. Methods: An institution-based cross-sectional study was conducted among 241 women attending Pap smear screening at public hospitals in Hawassa City from March to August 2025. Sociodemographic and clinical data were collected via interviews and medical records. Lesions were classified based on the standardized international framework for reporting cervical cytology results from Pap smears per the Bethesda system. Multivariable logistic regression identified predictors p<0.05). Result: Of 241 women screened (mean age 35.3 years), cervical epithelial abnormalities were detected in 52 (prevalence 21.6%). Atypical squamous cells of undetermined significance was the most common abnormality (16.6%). Multivariable analysis showed HIV infection was significantly associated with precancerous lesions (AOR = 3.7, 95% CI: 1.69-8.12, p<0.05), while hormonal contraceptive use was protective (AOR = 0.27, 95% CI: 0.11-0.67, p<0.05). Conclusion: These results underscore the urgent need to strengthen cervical cancer prevention through targeted screening and early intervention. Integrating routine HIV testing with Pap smear programs would be especially valuable. Health authorities should expand accessible screening for women aged 21-60, with particular attention to those living with HIV, to help reduce the burden of precancerous lesions.

15
Beyond event-rate enrichment: proteomic risk scores for mechanism-aware prevention trial design

Fieggen, J.; Simond, G.; Segal, B. M.; Noori, A.; Thakurta, A.; Butler, C. C.; Clifton, D. A.; Clifton, L.

2026-06-10 health informatics 10.64898/2026.06.09.26355266 medRxiv
Top 6%
0.2%
Show abstract

Background. Blood-based biomarkers are increasingly proposed for identifying high-risk individuals before clinical disease and for making prevention-oriented trials more efficient. Prognostic enrichment can increase event rates, but trial efficiency also depends on whether the intervention effect is preserved in the enriched population. Methods. Using the UK Biobank Pharma Proteomics Project, we trained disease-specific proteomic risk scores (ProRS) from 2,916 plasma proteins with elastic-net Cox models. We compared ProRS, polygenic risk scores (PRS), and combined PRS--ProRS scores across ten incident diseases. We estimated cumulative incidence and theoretical two-arm time-to-event trial sample sizes across risk strata. To evaluate effect preservation, we examined six intervention-analogue exposure--outcome pairs spanning genetic (PCSK9/coronary artery disease, APOE/Alzheimer's disease, PPARG/type 2 diabetes, IL23R/Crohn's disease), behavioural (physical activity/all-cause mortality), and pharmacological (RAAS inhibitors versus calcium channel blockers/coronary artery disease) examples. Results. ProRS outperformed PRS for 9 of 10 diseases (median C-index 0.75 versus 0.61). ProRS and PRS were weakly correlated (median Pearson |r| = 0.04), and joint PRS--ProRS stratification identified groups with higher observed incidence than either score alone for several endpoints. In the top risk quartile, combined-score enrichment reduced theoretical required sample sizes by 32--74\% under a fixed 20\% relative hazard reduction. These gains were not always preserved when stratum-specific intervention-analogue effects were used. Effects were broadly preserved for APOE/Alzheimer's disease and physical activity/mortality. The PPARG/type 2 diabetes effect attenuated toward the null under all three score types, showing that event-rate enrichment does not guarantee effect preservation. For IL23R/Crohn's disease and the antihypertensive comparison, point estimates differed across score types -- preserved under polygenic but attenuated under proteomic enrichment -- but confidence intervals were wide and overlapping. Conclusions. Proteomic risk scores can identify high-event-rate populations for prevention-oriented trials, but event-rate enrichment alone is insufficient for trial design. Biomarker-guided enrichment should evaluate mechanism-specific effect preservation and may be preferable as a stratification or adaptive-design variable rather than as a restrictive eligibility criterion.

16
Knowledge, attitudes and practices regarding risk factors for cardiovascular disease among women in an urban slum of Kathmandu, Nepal: A cross-sectional study.

Kasaju, M.; Shrestha, A. P.; Oli, N.; Vaidya, A.

2026-06-08 public and global health 10.64898/2026.06.04.26354909 medRxiv
Top 6%
0.2%
Show abstract

Introduction: Cardiovascular diseases (CVDs) are the leading cause for death and disability worldwide accounting for 75% of deaths in low- and middle-income countries (LMICs) like Nepal. Urbanization and globalization remains the major cause of rise in CVDs among urban poor population along with growth in slum settlements. This study aims to assess the knowledge, attitude and practice (KAP) of CVDs and its risk factors among women of one such urban poor community in Nepal. Methodology: This cross-sectional study (n=388) in the Sinamangal-Minbhawan slum area was conducted using semi structured questionnaire based on STEPs survey and HARDIC study among the participants selected through convenient sampling. Descriptive analysis was done using SPSS version 21 and KAP scores were further categorized based on median score to perform multivariate logistic analysis. Additionally, Anthropometric and blood pressure measurements were also recorded and analyzed. Results: The median age (Interquartile range) of participants was 33 years (17) with majority of them being Dalit by ethnicity, housewives, with up to primary level education belonging to upper lower socioeconomic class. More than half (53.3%) of the participants were obese and over 23% were hypertensive. While half of the hypertensive women were aware of their status, only 3% had their blood pressure under control.The median knowledge, attitude and practice (KAP) scores were 12, 60 and 10 respectively. The KAP scores were positively associated with socioeconomic status of the participants. Conclusion: The study revealed low knowledge with high prevalence of behavioral risk factors of CVDs along with high prevalence of other metabolic risk factors like high body mass index, high waist hip ratio and hypertension among women of slum area with a positive attitude to prevent CVDs and its risk factors.

17
Metatranscriptomics-Derived Disease Risk Scores as a Preventive, Diagnostic, and Treatment Support Tool

Hu, L.; Bass, M.; Patridge, E.; Molusky, M.; Antoine, G.; Vuyisich, M.; Banavar, G.

2026-06-06 genetic and genomic medicine 10.64898/2026.05.29.26354333 medRxiv
Top 6%
0.1%
Show abstract

Background: Chronic diseases and symptom syndromes often develop after prolonged biological changes that may precede formal diagnosis. RNA-based metatranscriptomics captures active microbial and human gene expression and may provide a functional layer for disease risk evaluation. To address this translational gap, we developed and validated a Disease Risk Score (DRS) framework that integrates metatranscriptome-derived pathway activity scores from stool, saliva, and blood samples, and evaluated its potential clinical utility as an adjunct risk-evaluation tool. Methods: DRS uses disease-specific sets of pathway activity scores derived from stool and saliva microbial functions, stool and saliva microbial taxa, and blood human gene expression. For each disease, 'not optimal' pathway scores are aggregated into a normalized cumulative odds ratio, or cOR, using score-level odds ratios, statistical significance, and literature-supported biological relevance derived from a Development Cohort of 22,369 individuals. A cOR [&ge;] 5 is defined as high risk. Performance is evaluated in an independent Validation Cohort of 15,908 individuals using self-reported diseases as the reference. Disease support requires both significant cOR separation between self-reported and not-reported (Cohen's d [&ge;] 0.2) and risk ratio enrichment of self-reported disease among individuals classified as high risk (95% CI of Risk Ratio > 1). Results: Of 20 initially evaluated diseases, 15 meet the prespecified validation criteria on the independent validation cohort: ADHD, anxiety, chronic fatigue syndrome, depression, GERD, hypertension, inflammatory bowel disease, IBS-C, IBS-D, insomnia, MASLD, obesity, obstructive sleep apnea, Sjogren's syndrome, and type 2 diabetes. Five selected clinical scenarios illustrate how DRS can support clinician-mediated decision making, including IBS subtype reclassification, improved diagnostic acceptance in IBS-D, personalized lifestyle counseling in MASLD and early type 2 diabetes, and diagnostic uncertainty in atypical GERD. Conclusions: DRS is a metatranscriptomics-based risk-stratification framework that aggregates active microbial and human pathway signals into interpretable disease-specific risk estimates across a wide range of disease conditions. Validation against self-reported disease labels in an independent cohort shows significant risk enrichment for each of 15 diseases. DRS is intended as an adjunct to clinical evaluation: a decision support tool in situations where routine care encounters uncertainty, delay, or low patient engagement. Future prospective studies using clinically adjudicated endpoints are needed to assess calibration and clinical outcomes.

18
Large-scale proteomics and timing of hypertensive disorders of pregnancy

Hauspurg, A.; Huang, X.; Greenland, P.; Pemberton, V.; Bairey Merz, C. N.; Saade, G. R.; Yee, L. M.; Levine, L. D.; Ranzini, A.; Haas, D. M.; Hoffman, M.; Lau, E.; Khan, S. S.; Kleiboeker, B.; Reddy, U. M.; Catov, J. M.; Grobman, W.

2026-06-11 obstetrics and gynecology 10.64898/2026.06.09.26355317 medRxiv
Top 7%
0.1%
Show abstract

Background: Hypertensive disorders of pregnancy (HDP) may first be diagnosed antepartum, during labor, or postpartum. We utilized untargeted large-scale proteomics to identify pathways associated with HDP based on timing of onset. Methods: We performed a nested case-control study comparing differential protein expression, from the SomaScan 7K platform, based on timing of onset of HDP versus controls (referent) using first-trimester samples from the NuMoM2b-Heart Health Study, a multi-site cohort that followed nulliparous individuals from the first trimester. Associations of proteins with timing of onset of HDP, adjusted for co-variates, were assessed using logistic regression q value-based false discovery rates and pathway enrichment and differential expression analysis were conducted. Results: Of 1628 individuals included, 678 had HDP, of which 67% manifested antepartum (AP), 29% intrapartum (IP), and 3% postpartum (PP). After adjusting for co-variates, compared to controls, 698 proteins, 39 proteins, and 144 proteins were differentially expressed in those with HDP according to AP, IP, PP onset, respectively. There was little overlap in individual protein expression based on timing of HDP. Pathway enrichment and graphical summary analyses suggested distinct processes. Specifically, there was downregulation of angiogenic proteins in AP HDP, downregulation of immune-related proteins in IP HDP, and upregulation of complement activation promoting fibrotic changes leading to cardiac dysfunction in PP HDP. Conclusion: There are differences in first-trimester protein expression based on whether HDP first manifests AP, IP or PP. This raises the possibility that there may be distinct mechanistic phenotypes that could uniquely inform diagnostic and therapeutic targets for HDP.

19
Genotype is a predictor of blood pressure variability and relative systemic hypertension risk in sickle cell disease

Bowers, A. S. A.; Henry, K.; McConnell, B.; Francis, C.; Thaxter-Nesbeth, K.

2026-06-10 hematology 10.64898/2026.06.06.26355049 medRxiv
Top 7%
0.1%
Show abstract

Background Blood pressure (BP) regulation in individuals with sickle cell disease (SCD) is influenced by a complex interplay of genetic and physiological factors. While SCD has traditionally been associated with lower BP, there is an increased risk of hypertension. Emerging BP research suggests significant heterogeneity across genotypes, age groups, and sex. Objectives: This study investigated the longitudinal effects of population-level characteristics and continuous clinical and laboratory predictors on systolic (SBP) and diastolic blood pressure (DBP) in individuals with SCD, with emphasis on the interactions between baseline and predicted blood pressure slopes over time. Methods We retrospectively analyzed longitudinal data from a cohort of 2,739 patients with diverse SCD genotypes. Descriptive statistics were documented across sex, age range, genotype, health status and relative systemic hypertension risk categories (rHTN-risk). Linear mixed-effects models provided estimates of fixed- and random-effects of baseline BP and of time-related BP effects, respectively. Post-estimation margins provided contrasts of baseline-adjusted BP means and of pre-specified time effects on BP patterns. Results Males had significantly higher baseline SBP ({beta} = 6.64, p < 0.001) but lower baseline DBP ({beta} = -2.61, p < 0.001) compared with age-matched HbSS females. Baseline SBP was more unstable compared with baseline DBP and baseline DBP was more predictive of future BP trends than baseline SBP. Genotype was a consistent predictor of DBP (p < 0.05), but not of SBP. Similarly, we observed increased risks of relative diastolic hypertension across most genotypes, while the prevalence and magnitude of systolic hypertension was lower across all genotype compared with HbSS. Conclusions Blood pressure trajectories in SCD patients are not uniform and are significantly related to genotype, age group and sex over time. Baseline diastolic levels were less heterogenous and exhibited clear upward trajectories over time. These findings support the need for patient-specific BP surveillance in the care and management of SCD.

20
Medical discrimination and the selective erosion of institutional health trust: evidence from the Health Information National Trends Survey 6 and 7

Park, A.; Yin, L.; Wong, A.; Lee, C.; Choi, Y.

2026-06-09 public and global health 10.64898/2026.06.06.26355057 medRxiv
Top 8%
0.1%
Show abstract

Medical discrimination may alter how patients relate to health information sources following adverse care encounters. We examined whether discrimination experience is associated with selective erosion of institutional health trust and with compensatory digital health engagement, using nationally representative data from the Health Information National Trends Survey (HINTS) 6 (2022; n=6,252) and HINTS 7 (2024; n=7,278). Survey-weighted modified Poisson regression estimated prevalence ratios (PRs) for binary high-trust outcomes, and survey-weighted ordinary least squares estimated coefficients for continuous outcomes; jackknife replicate weights (50 replicates) provided variance estimates. Discrimination was associated with substantially lower probability of high trust in the healthcare system (PR=0.39; 95% CI 0.30-0.52) and physicians (PR=0.85; 95% CI 0.77-0.94), with no significant association for trust in scientists, government, family, or religious organisations. The clinical-institutional pattern replicated in HINTS 6, which additionally showed reduced trust in scientists for race/ethnicity-based discrimination. Contrary to a disengagement hypothesis, discrimination-exposed adults showed higher probability of online health information seeking (PR=1.06), health app use (PR=1.11), and online provider messaging (PR=1.13); these associations persisted after adjustment for trust in physicians. Discrimination was independently associated with lower health self-efficacy (b=-0.271). Medical discrimination selectively erodes trust in clinical institutions while leaving broader epistemic trust largely intact. Despite this, discrimination-exposed patients engage more actively with digital health channels, consistent with compensatory reorientation toward non-clinical information sources. These findings describe engaged but institutionally alienated patients, with implications for restoring clinical trust and for equity-centred digital health design.